| Number of Variables | 4 |
|---|---|
| Number of Rows | 80230 |
| Missing Cells | 0 |
| Missing Cells (%) | 0.0% |
| Duplicate Rows | 0 |
| Duplicate Rows (%) | 0.0% |
| Total Size in Memory | 32.6 MB |
| Average Row Size in Memory | 425.5 B |
| Variable Types |
|
| idx is skewed | Skewed |
|---|---|
| docstring_tokens has a high cardinality: 56780 distinct values | High Cardinality |
| code_tokens has a high cardinality: 65851 distinct values | High Cardinality |
| url has a high cardinality: 80230 distinct values | High Cardinality |
| url has all distinct values | Unique |
numerical
| Approximate Distinct Count | 79560 |
|---|---|
| Approximate Unique (%) | 99.2% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Memory Size | 1283680 |
| Mean | 202457.1517 |
| Minimum | 11 |
| Maximum | 1.405e+06 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negatives | 0 |
| Negatives (%) | 0.0% |
| Minimum | 11 |
|---|---|
| 5-th Percentile | 13123.45 |
| Q1 | 48332.25 |
| Median | 87832.5 |
| Q3 | 134752.75 |
| 95-th Percentile | 959460 |
| Maximum | 1.405e+06 |
| Range | 1.405e+06 |
| IQR | 86420.5 |
| Mean | 202457.1517 |
|---|---|
| Standard Deviation | 287574.8404 |
| Variance | 8.2699e+10 |
| Sum | 1.6243e+10 |
| Skewness | 2.1703 |
| Kurtosis | 3.929 |
| Coefficient of Variation | 1.4204 |
categorical
| Approximate Distinct Count | 56780 |
|---|---|
| Approximate Unique (%) | 70.8% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory Size | 10573857 |
| Mean | 65.894 |
|---|---|
| Standard Deviation | 50.4707 |
| Median | 54 |
| Minimum | 2 |
| Maximum | 1108 |
| 1st row | ['Python3', 'imple... |
|---|---|
| 2nd row | ['Stores', 'the', ... |
| 3rd row | ['Traverse', 'the'... |
| 4th row | ['Stores', 'the', ... |
| 5th row | ['Traverse', 'the'... |
| Count | 2625301 |
|---|---|
| Lowercase Letter | 2519157 |
| Space Separator | 575120 |
| Uppercase Letter | 106144 |
| Dash Punctuation | 5712 |
| Decimal Number | 23133 |
categorical
| Approximate Distinct Count | 65851 |
|---|---|
| Approximate Unique (%) | 82.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory Size | 18590390 |
| Mean | 151.7518 |
|---|---|
| Standard Deviation | 144.0961 |
| Median | 105 |
| Minimum | 0 |
| Maximum | 3233 |
| 1st row | ['def', 'maxPresum... |
|---|---|
| 2nd row | ['X', '=', 'max', ... |
| 3rd row | ['for', 'i', 'in',... |
| 4th row | ['Y', '=', 'max', ... |
| 5th row | ['for', 'i', 'in',... |
| Count | 3903576 |
|---|---|
| Lowercase Letter | 1918212 |
| Space Separator | 1732826 |
| Uppercase Letter | 1985364 |
| Dash Punctuation | 24719 |
| Decimal Number | 177121 |
categorical
| Approximate Distinct Count | 80230 |
|---|---|
| Approximate Unique (%) | 100.0% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory Size | 6292016 |
| Mean | 13.4247 |
|---|---|
| Standard Deviation | 0.6503 |
| Median | 13 |
| Minimum | 10 |
| Maximum | 15 |
| 1st row | 10005-Python-1 |
|---|---|
| 2nd row | 10005-Python-2 |
| 3rd row | 10005-Python-3 |
| 4th row | 10005-Python-4 |
| 5th row | 10005-Python-5 |
| Count | 481380 |
|---|---|
| Lowercase Letter | 401150 |
| Space Separator | 0 |
| Uppercase Letter | 80230 |
| Dash Punctuation | 160460 |
| Decimal Number | 435226 |